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ADDRESS GENERATION FOR INTERLEAVERS IN TURBO ENCODERS AND DECODERS 



★ * * 

5 Field of the invention 

The present invention relates to techniques for 
generating addresses and was developed by paying specific 
attention to the possible application in inter leaving/de- 
interleaving modules. 
10 A preferred application of the present invention is 

in third generation mobile communication systems, and more 
to the point in the implementation of the turbo decoder 
mother interleaver described in the technical 
Specification 3 GPP TS 25.212 V3.5.0. (2000-12) Release 
15 1999. 

" Description of the prior art 
A turbo encoder is a component used in 
telecommunication systems with the^purpose of re-arranging 
the information to be transmitted^- with the possible 
20 addition of control codes, in such a way that the effect 
of impairment sources adversely affecting transmission can 
be compensated by means of complementary decoding at the 
decoder . 

Exemplary of telecommunication systems adapted for 
25 employing turbo encoding/decoding schemes are systems 

adopting the so-called Universal Mobile Telecommunications 

System (UMTS) standard. 

Key elements in turbo encoders/decoders (as defined 

e.g. in the 3 GPP TS 25.212 Specification) are the so- 
30 called interleaver/de- interleaver blocks. Operation of 

such a block essentially provides for the order of the 

elementary items of information (bits) included in a given 

block of length K to be changed according to a 

predetermined law. 
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In EP-A-1 195 910 an interleaver is disclosed for use 
in a turbo encoder for UMTS apparatus. The interleaver in 
question includes a register for updating and registering 
a plurality of parameters for setting an operating 
5 condition of the interleaver; a controller for generating 
a control signal for controlling an operation of the 
system by receiving the operating condition from the 
register; an address calculator for generating a finally 
interleaved address using an inter row permutation 

10 pattern, an intra row permutation pattern increment 
arrangement value and an intra row permutation basic 
sequence, provided by the register according to the 
control signal generated by the controller; and a data 
storage for sequentially storing data input to the turbo 

15 encoder and outputtiiig data corresponding to the address 
generated by the address calculator. 

Such an arrangement is thus based on an interleaver 
that includes a register for updating and registering a 
plurality of interleaving parameters (i.e. the number K of 

20 input data items, and those parameters designated R, C, p, 
li and TypeD) for setting an operating condition of the 
interleaver. This prior art device is therefore based on 
the assumption that these parameters are generated and 
obtained from outside the device. 

25 Therefore, such prior art document fails to teach how 

to implement an interleaver wherein the interleaving 
parameters can be dynamically and rapidly adapted, e.g. to 
take into account possible variations in the number K of 
input data items, which may in fact vary even with each 

30 packet being transmitted. 

In ""Low-Cost IP-blocks for UMTS turbo decoders" by 
Masera, Mazza, Piccinini, Viglione, Zamboni - Proceedings 
of ESSCIRC 2001, Villach, September 2001 an arrangement is 
disclosed for implementing in hardware form some of the 
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elements comprising an interleaver. Also this prior art 
document fails to provide directions useful in 
implementing an interleaver whose parameters can be 
dynamically and rapidly adapted and, more generally, an 
5 arrangement where the interleaving parameters can be 
directly and dynamically extracted from the information to 
be transmitted (essentially the value of k) . 

Objects and summary of the present invention 

The object of the present invention is thus to 

10 provide an address generation arrangement for use in 
interleavers/de-interleavers wherein the interleaving 
parameters can be directly and dynamically extracted from 
the information to be transmitted, thus permitting those 
parameters to be dynamically and rapidly adapted e.g. to 

15 possible variations in the number K of input data items to 
be interleaved/ de-interleaved. 

According to the present invention, such an object is 
achieved by means of an address generating system having 
the features specifically called for in the claims that 

20 follow. The invention also relates to a corresponding 
method of operation as well as to a turbo encoder/ decoder 
including the address generating arrangement of the 
invention as a part of an interleaver/de- interleaver 
block. Finally the inventions relates to a computer 

25 program product directly loadable in the internal memory 
of a digital computer (such as e.g. a processor) , the 
product including software code portions for performing 
the method of the invention when the product is nan on a 
digital computer. 

30 A preferred embodiment of the invention is an 

arrangement for generating addresses for interleaving/de- 
interleaving sequences including a given number (K) of 
items (e.g. in turbo encoders/decoders), the arrangement 
including at least one memory unit with a plurality of 
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records stored therein. Each record includes information 
signals indicative of a respective set of interleaving 
parameters corresponding to at least one value for the 
given number of data items (K) . 
5 Still preferably, each value for the given number of 

data items (K) identifies a corresponding set of 
parameters (R, C, p, v) for constructing a matrix (RxC) 
for arranging said the input sequence and effecting intra- 
row and inter-row permutation of the matrix to generate a 

10 permuted matrix for generating the output addresses. Sets 
of said parameters (R, C, p, v) are thus pron^tly 
available for all possible values of the given number of 
data items (K) . 

The Value of the given number of items (K) to be 

15 interleaved/de- interleaved is usually comprised in a given 
range of values, and the memory unit preferably includes a 
number of records that is substantially smaller than said 
given range of values. Each record in the memory unit is 
identified by a respective pointer. Associated with the 

20 memory unit there is a pointer retrieval circuit 
configured for generating for each value of the given 
number of data items in said given range a corresponding 
pointer pointing to a respective record in the memory 
unit - 

25 An embodiment of the invention permits a core for the fc 

generation of the interleaving addresses to be implemented 
both as a software solution with a low computational 
effort (using a commercial DSP or Micro Controller) , and - 
preferably - as an ASIC solution. This important feature 

30 leads to a stand-alone solution in the design of a turbo 
decoder processor used in the UMTS receivers. 

Brief description of the enclosed drawings 
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The invention will now be described / by way of 
% example only, by referring to the enclosed drawings, 

wherein: 

- Figure 1 is a general block diagram showing an 
5 interleaver implementation, 

- Figure 2 is a block diagram showing a first step of 
parameter generation in an embodiment of the invention, 

- Figure 3 illustrates in greater detail the block 
diagram of Figure 3 , and 

10 - Figure 4 is a general block diagram of an address 

generator according to the invention* 

Detailed description of a preferred embodiment of the 
invention 

By way of introduction to the description of an 
15 exemplary embodiment of the invention, the basic 
principles of operation of a turbo code internal 
interleaver as provided in the specification 3 GPP TS 
25-212 V3.5.0. (2002-12) Release 1999 will now be briefly 
recalled. 

20 Essentially, the turbo code internal interleaver 

consists of bits -input to a rectangular matrix with 
padding, intra-row and inter-row permutations of a 
rectangular matrix, and bits-output from the rectangular 
matrix with pruning, 

25 The bits input to the turbo code internal interleaver 

are denoted by x^x^,...,*^ , where K is the integer number 
of the bits (data items) to be interleaved and takes one 
value of 40 ^ K £ 5114. 

The relationship between the bits input to the turbo 
• 30 code internal interleaver and the bits input to the 

channel coding is defined by x k =o Mt and K = K±. 
Specifically, the following notations are used: 
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K Number of bits input to turbo code internal 

interleaver 

R Number of rows of rectangular matrix 

C . Number of columns of rectangular matrix 

5 p Prime number 

v Primitive root 

(sO')) 7€ {o,...p_ 2 } Base sequence for intra -row 

permutation 

g± Minimum prime integers 
10 ri Permuted prime integers 

(K0) fe {oi. ..a-i} Inter-row permutation pattern 

{ u i rW)y {oi c i) Intra " row permutation pattern of i-th row 

i Index of row number of rectangular matrix 

j Index of column number of rectangular matrix 

15 k Index of bit sequence 

As a first step, the bit sequence x x> x 29 x Z9 ... y x K input 
to the turbo code internal interleaver is written into the 
rectangular matrix by assigning to R the values 5, 10 and 
20 depending on the value of K. The rows of rectangular 
20 matrix are numbered 0, 1, , R - 1 from top to bottom, 

A second step involves determining the prime number 
to be used in the intra - permut a t i on , p, and the number of 
columns of rectangular matrix, C. This is done as a 
function of K and R 
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The columns of the rectangular matrix are numbered 0, 1, 
C - 1 from left to right. 

A third step involves writing the input bit sequence 
x l9 x 2 ,x 39 ... 9 x K into the RxC rectangular matrix row by row 
5 starting with bit y x in column 0 of row 0 where y k = x k for 
Je « 1, 2, K and if RxC>K, the dummy bits are padded such 
that y k =Qorl for k « K + 1, K + 2 , .RxC. 

These dummy bits are pruned away from the output of 
the rectangular matrix after intra -row and inter-row 
10 permutations . 

After the bits -input to the .RxC rectangular matrix, 
the intra-row and inter-row permutations for the RxC 
rectangular matrix are performed stepwise by using a six- 
step algorithm detailed in the Specification. 
15 ^ After intra-row and inter-row permutations, the bits 

of the permuted rectangular matrix are denoted by y 1 k . The 
output of the turbo code internal interleaver is the bit 
sequence read out column by column from the intra-row and 
inter-row permuted RxC rectangular matrix starting with 
20 bit y'x in row 0 of column 0 and ending with bit y'cR in 
row R - 1 of column C - 1 . 

The output is pruned by deleting dummy bits that were 
padded to the input of the rectangular matrix before 
intra-row and inter row permutations, i.e. bits y l k that 
25 corresponds to bits y k with k > K are removed from the 
output . 

The bits output from turbo code internal interleaver 
are denoted by x* lt x' 2 , ~ , x' K , where x' x corresponds to 
the bit y' k with smallest index k after pruning, x' 2 to 
30 the bit y* k with second smallest index k after pruning, 
and so on. 
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The number of bits output from the turbo code 
internal interleaver is K and the total number of pruned 
bits is R x C - K. 

All of the foregoing corresponds to processing 
5 criteria that are well known to those of skill in the art 
and, and such do not require to be further detailed 
herein. 

Essentially, the interleaving solution described in 
the technical Specification 3GPP TS 25.212 V3 - 5 . 0 . (2000- 
10 12) is comprised of three basic steps, namely: 

- constructing a matrix and inputting the incoming 
data to the matrix; 

- intra-row and inter-row permutations; 

- reading of the permutated data from the matrix 
15 obtained via intra-row and inter-row permutations. 

- The interleaver can be implemented using a data 
memory in which the incoming data are written sequentially 
and read in a specific order. 

In fact, any interleaving/de- interleaving process can 
20 be simply regarded as the result of either of the 
following operations: 

- a) taking from an input file the items (e.g. the 
bits) to be interleaved/ de- interleaved in the order they 
appear in such input file and re-arranging them in an 

25 output file at respective positions each identified by a 
given address; 

b) taking the items from an input file in a 
sequence identified by respective addresses and re- 
arranging them in an output file one after another as they 
30 are taken from the input file. 

Also, it will be appreciated that carrying out both 
operations indicated in the foregoing one after the other 
(in any order) , corresponds to a full interleaving/de - 
interleaving process. 
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Hereinafter, reference to an interleaving process 
will be made for the sake of simplicity, being otherwise 
understood that what is being stated in connection with 
interleaving applies directly also to de - interleaving . 
5 An interleaver module as well as a de-interleaver 

module can be easily implemented by resorting to the 
processing architecture corresponding to either solution. 

This has been schematically depicted in Figure 1, 
where reference numeral 10 denotes an address generator 

10 having an associated data memory 12 intended to co-operate 
with a turbo encoder/decoder (codec) 14. 

The codec 14 is intended to process (in a known 
manner, that does not require to be described in detail 
herein) data arranged in blocks, each block being 

15 comprised of K items to be interleaved/ de-interleaved . 

- The information related to the value of K (which, as 
indicated, may change with each different data packet 
transmitted/received) is sent from the codec 14 to the 
address generator 10. 

20 The data memory 12 is the basic element for 

performing the interleaving/de- interleaving operation. 

For instance this may occur by causing the codec 14 
to write the data into the memory 12 in a sequential order 
and then read the data from the memory 12 according to an 

25 address sequence defined by the address generator 10. 
Alternatively, the codec 14 may be caused to write the 
data into the memory 12 according to an address sequence 
defined by the address generator 10 and then read the data 
from the memory 12 in a sequential order. 

30 The address generator 10 described herein is 

essentially comprised of three units, namely: 

- a parameter generator 10a, 

- a permutation pattern generator 10b, and 

- an address generator 10c. 
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A possible general layout of a parameter generator 
10a for use in the arrangement disclosed herein is shown 
in figure 2. 

The layout shown is based on the recognition that the 
5 computational effort required for the generation of the 
parameters (R, C, p, v) needed for the construction of the 
interleaving matrix and the permutation patterns could be 
reduced if a look-up table (LUT) approach can be used* 

Stated otherwise, instead of positively generating 

10 (i.e. computing) such parameters whenever a new value of K 
is encountered (e.g. because the length - given by K - of 
the data blocks to be interleaved is changed) , one could 
contemplate the possibility of calculating those 
parameters - once for all - for all possible values of K, 

15 Such parameters could then be definitely stored in a 
memory to be selectively retrieved whenever a new value 
for K is to be managed. 

Such a solution will entail the significant advantage 
of permitting the address generator to be implemented 

20 essentially in hardware form, without the need of 
providing any specific computing software to be run for 
each new value for K, 

This will i.a. reduce processing time and thus 
represent a significant advantage as the interleaving 

25 process is already by itself relatively time consuming: 
the whole block of data to be interleaved is in fact to be 
made available before the interleaving process proper is 
started since the last input data to the process could be 
the first output data therefrom. 

30 In the look-up table arrangement considered, the 

input to the look-up table would then be the size of the 
data block incoming in the turbo encoder (K) . This may 
change in the range [40, 5114], that is assuming 5075 
possible different values. The concatenation of mappings 
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of different parameters <C, R, p, v) could comprise the 
output of the look-up table. The implementation of this 
look-up table could be a memory of 5075 records, each 
record including a respective set of parameters for each 
5 possible value of K. 

However advantageous, such a solution would not be 
quite efficient in view of the memory size required, and 
would inevitably penalize chip area occupation parameters 
that are critical e.g. for mobile applications such as 

10 UMTS mobile terminals. 

Implementing the reference algorithm described in the 
technical Specification 3GPP TS 25.212 V3.5.0. shows that, 
even if the possible values of K are 5075, the number of 
different records in the memory is much smaller than 5075, 

15 and in practice equal to about 200. 

^ As a result of this, a presently preferred embodiment 
of the arrangement shown herein provides for the memory 
comprising the look-up table considered in the foregoing 
being split in two different memories or memory areas, 

20 namely: 

a first memory area 20 essentially including 
storage locations for pointers addressed by the K value; 

- a second memory area 22 including (second) records 
for respective, different interleaving parameter sets, 
25 each addressed by a pointer. 

The number of such (second) records will be smaller 
than 5075, and in fact limited to about 200 (that is the 
number of possible different data sets resulting from the 
implementation of the algorithm defined in the 
30 Specification) . 

The arrangement will thus permit to identify, for 
each given value of K (5075 possible values) , a 
corresponding pointer to a respective record (i.e. a 
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respective set of interleaving parameters) in the second 

memory area. 4 

A further step in reducing the memory requirements 
can be taken by noting that the pointers will end up to be 
5 sorted in a growing order in the memory/ and thus orderly 
arranged in groups of adjacent pointers having the same 
most significant bits (MSB) . 

This feature can be used to further split the memory 
area 2 0 into two sub-units comprised of an MSB (Most 

10 Significant Bit) computation circuit 20a and a LSB (Least 
Significant Bit) pointer memory 20b. 

By resorting to that arrangement, a given number of 
MSBs for the pointers (for instance the first two bits in 
each pointer) can be determined by comparing the value of 

15 K with a number of thresholds in a group of comparators 
201^ 202, 203 and combining the outcome of these 
comparisons in a logic unit 204, The remaining, least 
significant bits (LSBs) can thus be retrieved from the LSB 
memory 2 0b, whose size will be correspondingly reduced. 

20 For instance, in the case considered of 5075 different 
values for K, the size of the memory 20b can be reduced 
from 5075x8 bits to 5075x6 bits since the two MSBs are 
already determined by the circuit 20a • 

By way of example, the thresholds of comparators 201, 

25 202, 203 may be set at values Tl, T2 and T3, respectively, 
with Tl lower than T2 and T2 in turn lower than T3 . 

The range of possible values for K will thus be 
partitioned in four portions or bands. Tl will be the 
upper bound for the first band, while also providing the 

30 lower bound for the second band, such second band lying 

u 

between Tl and T2 . The third band will lie between T2 and 
T3, while T3 will also represent the lower bound for the 
fourth portion or band. 
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Comparing the instant value of K against the 
% thresholds Tl, T2 and T3 in comparators 201 to 2 03 and 

processing the results of the respective comparisons in 
circuit 2 04 will therefore lead to locating K in one of 
5 those four bands and generating values 00, 01, 10 and 11/ 
respectively, for the two MSBs of the corresponding 
pointer . 

The pointer retrieval arrangement disclosed is thus 
comprised of: 

10 - a circuit 2 0a sensitive to the given number of 

items (K) to be interleaved/de- interleaved to derive 
therefrom the (two) most significant bits of the pointers, 
and 

- a further memory unit 2 0b having stored therein the 
15 (six) remaining, least significant bits of the pointers. 

- Of course a lower or a higher number of MSBs can be 
determined by the circuit 20a, the main aim being reaching 
the best trade off between memory occupation and 
combinational logic complexity. 
20 In order to reduce the computational effort required 

by the subsequent modules included in the interleaver, 
other information can be stored in the memory locations of 
the memory 20. 

For example a binary flag that is set to *1" when C 
25 is equal to p+1 . and K is equal to R*C can be useful to 
generate the output interleaving addresses without any 
extra complexity. Incidentally, such a binary flag 
essentially corresponds to the bit designated Type D in 
EP-A-1 195 910. 

30 Adding such a flag to the records in the memory 22 

will generally lead to an increase in the number of such 
records , in that otherwise identical records may have 
different values for the flag. 
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Experiments carried out by the Applicants show 
however that r even in this case r the number of records in 
the memory 22 will be slightly in excess of 200 , and 
typically equal to 219 (in comparison to the 5075 
5 different values for K) . 

Starting from the sets of parameters generated by the 
module 10a (these are in fact stored as respective records 
in the memory 22) , two kinds of permutation patterns must 
be computed* 

10 The first one, named vector e, is the intra- row 

permutation pattern. According to the standard procedure 
defined in specification 3GPP TS 25.212 V3.5.0, this 
vector is generated as a pseudo random sequence that uses 
as the initial seed the value v provided by the earlier 

15 stages of processing (that is, in the embodiment shown 
herein, the first module 10a) . 

In the arrangement shown herein, the pseudo random 
sequence (a*b mod c) is generated in module 10b by using a 
linear algorithm involving no multiplications or 

20 divisions. Such an algorithm is known in the area of 
cryptography as witnessed by the article *A Modulo 
Multiplication Hardware Design", by Adnan Gutub - Oregon 
State University - Electrical & Computer Engineering 
Department - ECE 575 Project Report - Winter 2000. 

25 Moreover, in the standard procedure defined in the 

specification 3 GPP TS 25.212 V3.5.0, in order to have a 

different permutation for each row, the s vector is 
» 

addressed by a pseudo random sequence and the seeds of the 
sequence are different for each row. 
30 Specifically, the algorithm described in the 

specification includes the generation of a sequence (r 
sequence) of seeds starting from another sequence (g 
sequence) of prime numbers. 
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As opposed thereto, in the arrangement shown herein, 
the- q sequence is directly used without the computation of 
the r sequence. Simply exchanging the order of the intra - 
row and inter-row permutations can do this. 
5 This optimisation allows reaching the same results 

without the inverse permutation described in section 
4.2.3.2.3.2-<4) of the 3GPP TS 25.212 V3 . 5 . 0 . 
Specification. 

For immediate reference this is the operation that 
10 requires the sequence -JW) to ^ subjected to 

permutation to make the sequence (n)^^..^ such that 

r r( i, = q± f i = ti, 1, R - 1, 

where ( r (0) fe { o , lt ...^-ij is the inter-row permutation pattern 

defined as the one of the four kinds of patterns shown in 
15 table 3 of 3 GPP TS 25.212 V3.5.0. as a function of the 
number of input bits K. 

This approach leads to avoiding an unnecessary 
reading procedure from a memory (not shown) in which the T 
values are stored* 
20 The sequence q can be generated by reading a prime 

numbers table, and a little look up table for the 
management of the greatest common divisor operation 
described in section 4 .2 . 3 . 2 . 3 . 2- (3) of the 3 GPP TS 25.212 
V3 . 5 . 0 . 

25 Again, for immediate reference, the operation in 

question assigns g 0 «= 1 to be the first prime integer in 
the sequence (tf/)/^,...^,} * and determines the prime integer q± 
in the sequence {9t) ic fp X „.^ to be a least prime integer such 

that g.c.d(qi, p - 1) ■ 1, a > 6, and q± > qr (i _ x > for each 
30 2, J8-1. Here g.c.d. is greatest common 

divisor. 
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Specifically, in the block diagram of figure 4, 
reference numeral 3 0 designates a block that, starting 
from the parameters derived from the block 10b, generates 
the vectors s and g. These vectors are stored in 
5 respective memory blocks designated 31 and 32 . 

Finally, reference 10c designates a block that 
implements the following relationship: 

X r = T(i) 

U c = sG*q(i)mod(p-l) 
10 Xc =f(U*> 

Address jnit = C* X r +X e 

where s, q, and p are the entities already identified 
in the foregoing, X r and Xc aire row and column indexes of 
15 the matrix before the permutation operations, i and j are 
the permuted matrix indexes, U c is the intra-row 
permutation parameter defined as (^W)^, c ^ in the 

Specification, and / is a function of U c which implements 
algorithm steps as described into section 4.2.3.2.3.2 -5 

20 of 3 GPP TS25.212 technical specifications. 

The output of the block 10c is represented by the 
interleaving/de- interleaving addresses to be used by the 
turbo codec 14 (see figure 1) . 

It will be appreciated that the technique described 

25 herein for obtaining the interleaving parameters such as 
R, C, p, v, flag (TypeD) directly from the value K is 
adapted to be used in conjunction with other arrangements 
for effecting inter-row and intra-row permutations already 
know in the art. 

30 Of course, without prejudice to the underlying 

principle of the invention, the details and embodiments 
may vary, even significantly, with respect to what has 



WO 2004/055992 



PCT/EP2002/014320 



17 

been described by way of example only, without departing 
from the scope of the invention as defined by the claims 
that follow. 



